OOMELl-sequence Data on CRF.workFile 

individual Applicant 



Street : Specklesteinstrasse 55 
City : Eukirchen 
State : 

Country : Germany 

Postal Code : 

PhoneNumber : 

FaxNumber : 

Email Address : 
<110> LastName : Happe 
<110> FirstName : Thomas 
<110> Middleinitial : 
<110> Suffix : 

Application Project 



<120> Title : Hydrogen Production 

<130> AppFileReference : OlMELl 

<140> CurrentAppNumber : 10/077,699 

<141> CurrentFi lingDate : 2002-02-15 

Sequence 



<213> OrganismName : Scenedesmus obliquus 
<400> Presequencestring : 
gttgctgttg ctgttctaga acaatccata cacacgatta gattgagctc accttcagct 
cacggaaaat tcttcaggcc tcaacccttc agctccaccc tgcctttctg gaaaaatgca 
ctcgtggctc tacagggtga gcaaccaggg gcgcaactgc agggcatgct catacagaac 
atgctgccgc agctgatcat cgctcagcag tgcagtcaag ctgcgcactg gcagcttgca 
ttgtagctgg tgtacaacat tccagaagcc gactggtatt cgttgcaatt gtcacaattg 
tgacgcccat gcaaggccca cgagcaatat cgactgcaga accctgtgct gggatctacg 
ggaatgattg gattggacga tgtcagggcg ttcgacagca ccgtaccaaa gcttgccaaa 
ctttagcagc ggctgctagc aaccacgaga taagccatgg ccacaacctt gcaacatcgc 
gcatctgcag ccgccgatgc atgcaaggtc ggtgttgtgc ggttcctgct tgctctgctt 
caggcaacac agcctccagg tgttcaactt gaaggtgtga caccactggt gtgctggcag 
ctggccattc ggtttaagcc aagcagtaca gcgctgtcag cttcatcccc gcctggttac 
tgtgatgtat gtgcttctga tcaagcggtc ctccatgccg tccgaacaga actgcgctgt 
aagcttacgc agccccaacc ggctccgagc agcatgccct taagtggcgg gaaaactgcc 
agggacggtg taagggcgcc attcagcgct cgatactgta agattgtttt agatgaaaca 
gaaatacacc tccggagctg cgagtagcga ggtgattttg cataagggat ccacactgtt 
gtgggcgcac gtccaagaaa tgtttacccg tttcgattga cagcaaaaca tcatgatcat 
caaaggagtg catcgacagt caacgatcac caggtgatta cgtttgtcac tgacaagcgc 
cctctacgtg cgccttgggc ctacatatgc cctgctgtgg gagtacccgt gcacaacaga 
gcgttagaga tacttcatag ctgcaactag actaccttta ccctaacgaa atcaccctag 
accgacagtg tcggagtagc tgcgacccaa acgtgatggc gagcggattg cttctcaagc 
agcgctcggt atgcctgagt ggcaaccggg aggtcggtat gctgtttctg tccgcccgcc 
agtgaacagg cgggctgtgg tggcagcagg tgcgcttctt ctgaagggca gctagggctg 
tttcgggcag tgcatgccgg cctattttgg gttgctcgga gcaataatat gtactatatt 
gctctcgtgg agctgtgttg cgccacgtgc ttgccttggc gcctgttgac cccggaccct 
ccacgttgct tcttgccgct gcagagcgca ggcgccttgt tgtgcgggca gctggcccaa 
cagcagaatg tgattgccca ccagctcccg cgcccaaggc cccgcactgg cagcagacgc 
tagatgagct aggtgagctg cgtgacattg gaagtctggt gtccgcaact gctctctgtg 
catgctgaca tccggaatca agtgccaaga agcagggctc gtgtgggtca tttgtgggca 
ggtttgcagc agcttgccgt gttcaagcag cagcatgtgg gctgacacat actgctgccg 
tgcttctgct gtcctgcagc caagcctaag gagcagcgca aggtgatgat cgcccagatc 
gcaccagcag tgcgcgtggc tattgcagag accatgggac tcaaccctgg ggatgtgaca 
gttggccaga tggtgaccgg cctgcgcatg ctgggctttg attatgtgtt tggtgagtta 
cacagtgttt agtgctgcag cagtccagag cagcttgtgc tagttgatgt tgatcctttg 
ggcctgggat atccagctgg acgtcttaca ctgttttttt agcgtccgga gtgggctagt 
caacaacagt gagcgctgta tcatgtggtt tgttcatgcg tgcgtcgcat gcatgtggcc 
taaccagctg ctgccagcgt gtgcatgtgc ttggtgctgt tttggtgctt ggctggtgag 
cagccgcttt ctgtgtttat gtttggctcc tgttccatgc atgttctttg ttctgctgtg 
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OOMELl-Sequence Data on CRF.workFile 

actcatctac tgctgctgct ggtgcatctg ctgcttgcag acacgctgtt tggtgctgac 2280 

ctcaccatca tggaggaggg cacagagcta cggcacaggc ttcaggtcag tggtgatggt 2340 

gtactgctgt gttcattatg ccatgaggga ctttggtgtt gccatcaaca gctcacactt 2400 

gtagttactg gcggtagctg cagcgacagg tggatgcata tcctgcagca catatcctgc 2460 

agcaggcagc agcattcatg catgcatccc tttgctcccc tgtctccttg tgctgacagt 2520 

gctgcacact agcgccagcc acaccaggga tgtcgataac aatcagtctg atgtcatcca 2580 

cggtgtttta aaacacatct cttgctgctt gctgcttgca ggaccacctg gagcagcacc 2640 

ccaacaagga ggagccgctg cccatgttca ccagctgctg ccctggctgg gtggccatgg 2700 

tggagaagtc caaccccgag ctcatcccct acctgtcttc ctgcaagtcg ccccagatga 2760 

tgctgggcgc agtcatcaag aactacttcg ctgccgaggc cggcgccaag cctgaggaca 2820 

tctgcaacgt gagcgtgatg ccctgcgtgc gcaagcaggg cgaggctgac cgcgagtggt 2880 

tcaacaccac aggggctggc ggcgcgaacg tggaccacgt catgacaact gcagagctgg 2940 

gcaagatctt tgtggagcgc ggaatcaagc tgaacgacct gcaggagtcg ccctttgaca 3000 

accccgtcgg cgagggcagc ggcggcggcg tgctgttcgg caccactgga ggcgtgatgg 3060 

aggcggcgct gcgcaccgtg tacgaagtgg tgagtgtcag tgtggcggca gctgtgggtt 3120 

gtatcgcagc agcagtttgc gcatttggca gtagtgcagc atgtgctggc atgcgcagag 3180 

ttgcgcccac ctgtgttgga tgtgagctgg gtttgcatag aggcgccatc tgcagaagcg 3240 

tgcacttctg catactgctg ctgctgatct actgccttgc ctttcaccat acccgccacc 3300 

cgtaataatc tctcctgctg cactagccct agacagtgcc agacgttgac gctttctgct 3360 

gccgtgttgt gcatctccac gcacctctgc tgcaccgcag gtcacacaga agcctttgga 3420 

ccgcatcgtc tttgaggacg tgcgcggcct ggagggcatc aaggagtcca cgctgcacct 3480 

caccccaggc cccaccagcc ccttcaaggc ctttgcaggc gcagacggca ccggcatcac 3540 

cctcaacatc gcggtcgcca acggcctcgg caatgccaag aagctcatca agcagctggc 3600 

tgcaggcgag agcaagtacg acttcatcga ggtcatggcc tgccccggcg gctgcatcgg 3660 

cggcggcggc cagccgcgca gcgcggacaa gcagatcctg cagaagcgcc aggcggccat 3720 

gtacgacctg gacgagcgcg cggtgatccg gcgcagccac gagaacccgc tgattggcgc 3780 

gctgtatgag aagttcctgg gcgagcccaa cggccacaag gcgcacgagc tgctgcacac 3840 

gcactacgtg gccggcggcg tgcccgatga gaagtgaagc ggtggctggt gatgctggct 3900 

gcggcgaaga aacggtgggc atggtggtgg gtgggttgct gcatggtggt gtcgctcgtg 3960 

cagcatggtg ggtttgcggt tgtgatgttg ggcatgctgc acggaggtgt ttgcatggtt 4020 

atggatatgg ttcaggtgct gtgctgcgtc gcatgccata agcaccttgt gaccctgtgc 4080 

gatgcataaa aatagatatt gccatttggt tccaggctgg tggtggcagt ggctggttaa 4140 

caggggagtg tgtgtgtttg tgtgtcttca ttgtcggtgt gttcttgctg catgtattgt 4200 

agtgtaatgg gttatgcacg cctgcatgcg cacgcgctcc tcgtgctgcg acagtgcaca 4260 

acgcacagcg tgatacagct gcaggacgtt tgcggaaaaa cacttgttac tggtgacggc 4320 

tgaagcagcg atgatggaga gaatggattc gctgctatct cacagggcgt ggctgctgca 4380 

tcgccatggc atgtccctgt tgcacgcaat tgcctgcgta attttgatag tggcagcact 4440 

gaggcagctg caaggccttc tgccagcggc tgtttgtgtc ctatctgtgt ttacaggcag 4500 

ctgcatttga aggcaagggg gttggccatc actcactttg atcactcact ttgaagcagg 4560 

cttccatcca tgtattggtc aacgcactga agttcttttt ttgtcaccag gcagcagtat 4620 

tgtgtgcaca ctacttgcta tggagatgac agcagcatca atctcaagca tgatgaaagc 4680 

gtatgttgta tcagtgcccc attttgcaga ctcttaagag ctttaccttc tcaggggttg 4740 

cagcaggtgg tggtcagcca gttgagggag tgtgtggctg ttgtcttgcc accatgtgag 4800 

tattgaaacc accatcctga gctaagtgtt caggcatctt accctcatac cccgctaccc 4860 

tgctactggg agtttcgttt cattgtattg gcagccgttt actaattagt aatggcgctt 4920 

gagcgaggca tgtcttgata tgtatgcctt aggagagtgt: gagctcaact caattctcat 4980 

aagtgtaagc cacacaactg g 5001 
<212> Type : DNA 
<2U> Length : 5001 

SequenceName : Sequence # 1 

SequenceDescription : 

Sequence 



<213> Organi smName : Chlamydomonus rheinhadtii 
<400> PreSequenceString : 

gatgatatgg atcgtcgtct ggtgctcaag ctcatgggtt ttggctggcc gccgacgctg 60 

tcccggaagc accagcagca gcaggggcca gggggtcggg tgatgatgtg ggcgcggtgt 120 

atggaggtgg caccctgtat gttcatctgg gcgcttaatt gcgttaagcc attcgagccc 180 

acttcggagg caagttcgat ttggtggcgt gagatccgcc tcaccccggt tactgcacgt 240 

gcaggagtgg tgtgcagcag tagtcggcag ggtgtcccca ggtattgtgg cgtttgtcgc 300 

acggtatgcc ggtgcagtgc tcaggtgcgt aaagcggcgc gtcgcggtgt tggtcgcaac 360 

cggatgcttg aagccgaaat cgcttcgtcg gcacgtgtaa gtcattgttt gtttgggtga 420 
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gctttgctgg cagcgtagaa ccgctgtggc ggacacacgc tcagcaaggg ccaagggggg 480 

cgtccaagcc aaggtccaag cgcgcatccc ctcacccctg caccaatgtc caacaccgac 540 

agtaatccac gctccgctac gtcgcaagca ggcaatcatg cgtgtctaac atgactgaac 600 

tgccccgctg cccgtgaagg gcgatcgtca cgaagttttg tttgcatggt cgtatcggtg 660 

tatatgcgca cgcattgttg ccgacacaac ggacacaact atccggctgc ctactgttgt 720 

ataagggtca tagaatctag cgttatcctt ccacgagcgt gtggcagcct gctggcgtgg 780 

acgagctgtc atgcgttgtt ccgttatgtg tcgtcaaacg ccttcgagcg ctgcccggaa 840 

caatgcgtac tagtatagga gccatgaggc aagtgaacag aagcgggctg actggtcaag 900 

gcgcacgata gggctgacga gcgtgctgac ggggtgtacc gccgagtgtc cgctgcattc 960 

ccgccggatt gggaaatcgc gatggtcgcg cataggcaag ctcgcaaatg ctgtcagctt 1020 

atcttacatg aacacacaaa cactctcgca ggcactagcc tcaaaccctc gaaacctttt 1080 

tccaacagtt tacaccccaa ttcggacgcc gctccaagct cgctccgttg ctccttcatc 1140 

gcaccaccta ttatttctaa tatcgtagac gcgacaagat gtcggcgctc gtgctgaagc 1200 

cctgcgcggc cgtgtctatt cgcggcagct cctgcagggc gcggcaggtc gccccccgcg 1260 

ctccgctcgc agccagcacc gtgcgtgtag cccttgcaac acttgaggcg cccgcacgcc 1320 

gcctagggtg agggcgacgc agtgaacgca gtttcgatgg gtcactttgt cgcttttgcg 1380 

gaagcctccg aaacgtcccg cgaggttcaa acggccccga atgaccacac ccatatggcc 1440 

actgggaata ataacgcagc aacgtcgctt gcgcggctgc cgcacccgct gcggaggcgc 1500 

ctttgagtca tgtccagcag gcgctcgccg agcttggagc gaacggccga gcgagcgcgc 1560 

acgcattgtt gtggtcaagt ctctccactc agtccgaccc cccacacggc gtaggggtct 1620 

gaagtccacc aactcctcac acaccccaag gaagggacgt aagcccccct ggctacgctt 1680 

tacccagcag ccacagcgac agagcgcccc aacataggct cgagatagaa cgcacctgaa 1740 

ctgtgacact tacaatggaa aggaactgcg gatggcctta aagtcaagca ttttgtgacg 1800 

agtcggctcg gaatccccat cggcgcccgt ccgttcgtct tcatcaccgc ctgaaacggc 1860 

gcacgcgcaa tagtgcgcac ttgatgcctt tcggtccaac gcctctgtca gctaacactt 1920 

tccagggcca gcgcggactc gagaaccctc tttcctggca accttggttt ggctggacct 1980 

ggcaaccttg gtttggctgg caccaacctt gacccacata aatctctccc cccccccctt 2040 

atgcccacag ccaagcccaa ggacgacccc acgcgcaagc acgtctgcgt gcaggtggct 2100 

ccggccgttc gtgtcgctat tgccgagacc ctgggcctgg cgccgggcgc caccaccccc 2160 

aagcagctgg ccgagggcct ccgccgcctc ggctttgacg aggtaggtgc gctcgctgct 2220 

gcagtgccca acacgcatct tccagctcac cgcctccagt cagcaccttg gcatgcatgc 2280 

ttggcgcatc tgccgcctca ttgccgcctc gcggcctcgc cgctgcctgc atcaagcctg 2340 

cctgcctgcc tgcccgccct cacgccccag gtgtttgaca cgctgtttgg cgccgacctg 2400 

accatcatgg aggagggcag cgagctgctg caccgcctca ccgagcacct ggaggcccac 2460 

ccgcactccg acgagccgct gcccatgttc accagctgct gccccggctg gatcggtagc 2520 

agcgcggcgt gcttgcttag ggccccataa cctgtcttgg gcccccggcg tccgcctctc 2580 

cacctacctg caacatgtac gtgcctacgg tattgtcgca tgtctcttga cgatttgggt 2640 

cgaccttacc tttgccttgt gtcctttctc cacccccac'c cgcctctttc ctcgccggcc 2700 

cccctcgcgc agctatgctg gagaaatctt acccggacct gatcccctac gtgagcagct 2760 

gcaagagccc ccagatgatg ctggcggcca tggtcaagtc ctacctagcg gaaaagaagg 2820 

gcatcgcgcc aaaggacatg gtcatggtgt ccatcatgcc ctgcacgcgc aagcagtcgg 2880 

aggctgaccg cgactggttc tgtgtggacg ccgaccccac cctgcgccag ctggaccacg 2940 

tcatcaccac cgtggagctg ggcaacatct tcaaggagcg cggcatcaac ctggccgagc 3000 

tgcccgaggg cgagtgggac aatccaatgg gcgtgggctc gggcgccggc gtgctgttcg 3060 

gcaccaccgg cggtgtcatg gaggcggcgc tgcgcacggt gggtctgtga gagccggttg 3120 

attggcccgg cagaacgcat acacttgctg aacctttgat gcgggataag caaggctacc 3180 

gatccgcgtc tttttacacc tgtttatcac gtcgctgagc aagctcgtga cacctgcagg 3240 

cctatgagct gttcacgggc acgccgctgc cgcgcctgag cctgagcgag gtgcgcggca 3300 

tggacggcat caaggagacc aacatcacca tggtgcccgc gcccgggtcc aagtttgagg 3360 

agctgctgaa gcaccgcgcc gccgcgcgcg ccgaggccgc cgcgcacggc acccccgggc 3420 

cgctggcctg ggacggcggc gcgggcttca ccagcgagga cggcaggggc ggcatcacac 3480 

tgcgcgtggc cgtggccaac gggctgggca acgccaagaa gctgatcacc aagatgcagg 3540 

ccggcgaggc caagtacgac tttgtggaga tcatggcctg ccccgcgggc tgtgtgggcg 3600 

gcggcggcca gccccgctcc accgacaagg ccatcacgca gaagcggcag gcggcgctgt 3660 

acaacctgga cgagaagtga gcgggcggcg ctgctgggat tgggcagggg agggaaggga 3720 

ctgcggggca gggtgcggcg ggaaacggaa atgggcaagg ctcgaggtgg agggcggggt 3780 

gggttggggt tacttgctac aggttggcgg gcaggatgtg atggaagcag tgtggaggag 3840 

gtgtgcgtag ggtcccgacg acggtattcg cacgagcaaa gagggtcggc acttcctgac 3900 

acaatgtgcg cctgcacgtg cgctcctgtt gctgccccag gtccacgctg cgccgcagcc 3960 

acgagaaccc gtccatccgc gagctgtacg acacgtacct cggagagccg ctgggccaca 4020 

aggtgggggg ggttgtatac taccagccca aatgacgggg ctggtcgggg gcgttggaga 4080 

ggcgggccgg gagggaggcg ggctgggtgt ggggcaacag caggtgaagg gacggggggg 4140 

cgcactgggc agggcggtac atgccttgtc ctgatagcta cccacacgcg actgttgcta 4200 
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catggatgca tgacgtgtgc cgtgtgcttg acccctgcag gcgcacgagc tgctgcacac 4260 

ccactacgtg gccggcggcg tggaggagaa ggacgagaag aagtgaggag cgccagaggc 4320 

tctttgggcg gagacagctt caaagcgagg gggcgtatta gcagtaccgt aaatatgcac 4380 

tgatgggtga tgcgggtgtc ctcctttata ttgaatgggg tcaaaatagg cggcgggtca 4440 

aatgtttcct ttttgagtgg tgtcacagca tggggcacgt gtgcggaggc cagttgccct 4500 

ccagtgcacg cgctcccggt gtgtggccgc actggccttg gataatgcac cggtggagga 4560 

ttatggaaga gggggactca gaaggctcat tattggacaa tgcctggtct cttccacatt 4620 

ggtgtgagcg cggctccgca taggctgttc actgcacgct ggcattaggc gtaggtactg 4680 

gcatgaggga gcgcggcttg ctaaccgaat ggcgtatccc tccagggcac gtcggaatgg 4740 

cgcgtgccca tcaacgcaaa ttcttggcct tcatcgcttc tggatattga agctgcacaa 4800 

acctgcattc tatttgcttg tttacacgtg ccccaatctt ggttggaagc taaacatgtt 4860 

tgggaacaat tcatcttact aaagcgtgtg ggggttgagg atgcgcacgt tgtgcgctgg 4920 

tgggtgggcg ggaacgtggg tagcatttag gctagctggc atacgacaac ggggcccgtg 4980 

aggattgagc acttgactcg cgaacttatg aacgtagcgc tttataccca ccgtatgcga 5040 

ttgacgttgg tgtaggcaac caggcggtag gaaggcggag agatgcattg caaacgcctg 5100 

taaaagaacg gcatagctac tagacactct gatgtggacc cttggcgcag ccacgacagg 5160 

agaggtgtgc atcagccgct tgtaagcacg cacttctgag aaaaaaaa 5208 
<212> Type : DNA 
<211> Length : 5208 

SequenceName : Sequence # 2 

SequenceDescri ption : 

Sequence 



<213> Organi smName : Chlorella fusca 
<400> PreSequenceString : 

gcggaattac tagtgataag cagtggtaac aacgcagagt cgcgggcagg gactcgatca 60 

gttgttatgt gttgccccgt ggttgcaagt aggcacgcag ggcgtgcaag gcatgttgct 120 

gtccgtgcag cagggccaac atctgagtgt gattgtcctc caacacctca ggccaagctg 180 

cctcactggc agcaggctct ggatgagctc gccaagccca aggagagcag gaggttgatg 240 

atcgcgcaaa tcgcctccgc tgttcgtgtc gctattgctg agaccattgg cttggcccca 300 

ggagatgtca ccattgggca gctcgtgact gggctgcgta tgcttggctt tgattatgtc 360 

tttggtaagc agcagcatct tgcattacac ttgcagttgg tcgtcacatg cacctaatca 420 

gatgttagcc ctctggaaca tttttgcctg tttggtgctt acctgaccaa ctgctgcctg 480 

gtatggccaa cttgtgaagc tgcgtgtgtt ggcgttgcta cagacaccct gtttggtgct 540 

gacctgacca ttatggagga gggaacggag ctgctgcatc gcctgcagga ccatctggag 600 

cagcacccca acaaggaggt gagtaagcca gctgggtggt ctaccaccca gcaccagctc 660 

gagacagcag ccttgcatca acactcacaa cgtctagctc ctccttaaat gagcggacca 720 

aacctgtgag tggcaccatg tcagctgccc ctcgcaccaa agcacagcat ggcctgtctg 780 

tcgtcgattg ccacatgagt gtttgcgttg ttatgcaagt gcctgaacaa actgcatatt 840 

cctgtgtctc tctgcgttcg cacaggagcc actgcccatg ttcaccagtt gctgcccagg 900 

ctgggttgcc atggttgaaa agagcaatcc tgagctcatc ccctacctgt catcttgcaa 960 

gtcgcctcag atgatgcttg gggccgttat caagaactac tatgcacagc aggttggagt 1020 

gcagcccagt gacatctgca acgtgtcagt catgccatgc gtacgcaagc agggagaggc 1080 

tgaccgggag tggttcaaca ccacaggtgg gcgcaggcag tgtatcacca gtactggtgt 1140 

tctccgtgtg ttgtcagtgt gtctgttaga ggctggatac tctccagtgc agtgctgatg 1200 

cagagtggcg gctggtgtgc agcagcgacc ccaagaacac tgagagctgg caattcaatg 1260 

ggcttgcttg cttactgtca gcttcctttt cctgcaggtg cagtgacata cggtctgcat 1320 

caaggctcaa acatgttgtg tatgtatgtg tgatgttgca attgcaggcc ttgcccgtga 1380 

tgttgatcat gtggtgacta ctgctgaggt tggtaagata ttcctggagc gtggcatcaa 1440 

gctgaatgag ctgccagaga gcaactttga caaccccatt ggcgagggca caggtggtgc 1500 

tctgctgttt ggcaccactg gaggtgtcat ggaggcagca cttcgcacag tctatgaagt 1560 

ggtgagtggt actgcttcag tttcagtcag tgtaccaacc aagctactgc aattgcatag 1620 

cgccagtttt ctgccatcaa tgacctgctt tgtaagtagc tgatacttta ccaaccactg 1680 

gtatttgtgg ttatcctgcc atagcacatg ccttctcctg ctgttggctt tatcaacctg 1740 

ttggtctatg tgtcactgct gtgctgcagg ttacccagaa gcccatgggt cgtgttgact 1800 

ttgaggaggt gcgaggcctt gaaggaatca aggaggcaga gatcacactc aagccaggag 1860 

acgacagccc attcaaagcc ttcgcaggag ctgatgggca gggcatcacg ctcaagattg 1920 

cagtagccaa tgggcttggc aatgccaaga agctcatcaa gagcctgtca gagggcaagg 1980 

ccaagtatga tttcattgag gtcatggcat gccctggtgg ctgcattggc ggaggcggtc 2040 

agccccgcag tactgacaag cagatcctgc agaagcgcca gcaggctatg tacaacctgg 2100 

atgagcgcag taccatccgc cgcagccatg ataacccatt catccaggcg ctgtatgaca 2160 

agttcctagg cgcacccaac agccacaagg cacatgatct gctgcacaca cactatgtgg 2220 
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OOMELl-sequence Data on CRF.WorkFi le 
caggtggaat tccagaggag aagtgaggga ccgaggccgg agtggtgtta ttagtgtaga 
gctaggcagc agggatctgg ccgcatttgg gtgctgttgt ttggtttggc atcaaagata 
tgatgaatgt acaatctatt gggttctttg tatctcattc atgactgctg cttggtgagg 
tatgggccag gaagaagccc gcatcaatgc atgtgaacta ggtggctcca catatgaacc 
ctatctggat gtttaaggta cctgaaacaa tagtgcatcg gctctgcatg gctcaacaac 
ctgtcttcag agcaggtgta ttccacacca tcttgattta cctaccactc tgtagttcaa 
gtggtcaaat tgaatgtcta tggcagctac gcctgcagtt catagtctat gaaggtttca 
ccagagtcca tgtccctcat attttttgtt ttatatgcct tgattatgcc ccttgaacca 
tgctcaatgc acacaagttg gtcgcaggac aggcggcatc gtacatctca attttcagaa 
cttgtcagtg cggcattgcc ttatttgtac tcttgcagtc ctgtttcacc cttgctactg 
ccttgcatgc atcttgtttt tgcaagcaac agctcatgca ttgcaatcga tcatcacgta 
catccgtgcc atattcacat ggttttgact tgcaaatcaa ccaccaggca gtgggtaaat 
tgccaggctg ggtgcacttt gggccatttg ggcagccctc ttgtggcgag ctttgctgca 
gggccaagct gagtgcatca gactcagcag gctgctgctg gcactgtaga atgctgaaaa 
gggcattcaa ctacatgtca ttattaggtt gacctgagac agccgtaaga atatcattgt 
gtgctgaact tagtcgtcaa tgtcatgcca tgatgtgtgt ttcagggatg gataagggag 
gtccttcctc aattacatgc ctttcaagag acttcaatat ctgttgtcag tgacttgttt 
gtgtttgctt aatccagtgg ttctc 
<212> Type : DNA 
<211> Length : 3265 

sequenceName : sequence # 3 

SequenceDescription : 

Sequence 

<213> Organi smName : Scenedesmus obliquus 
<400> PreSequenceStri ng : 

MPEWQPGGRY AVSVRPPVNR RAVVAAERRR LVVRAAGPTA ECDCPPAPAP KAPHWQQTLD 
ELAKPKEQRK VMIAQIAPAV RVAIAETMGL NPGDVTVGQM VTGLRMLGFD YVFDTLFGAD 
LTIMEEGTEL LHRLQDHLEQ HPNKEEPLPM FTSCCPGWVA MVEKSNPELI PYLSSCKSPQ 
MMLGAVIKNY FAAEAGAKPE DICNVSVMPC VRKQGEADRE WFNTTGAGGA NVDHVMTTAE 
LGKIFVERGI KLNDLQETPF DNPVGEGSGG VLFGTTGGVM EAALRTVYEV VTQKPLDRIV 
FEDVRGLEGI KESTLHLTPG PTSPFKAFAG ADGTGITLNI AVANGLGNAK KLIKQLAAGE 
SKYDFIEVMA CPGGCIGGGG QPRSADKQIL QKRQAAMYDL DERAVIRRSH ENPLIGALYE 
KFLGEPNGHK AHELLHTHYV AGGVPDEK 
<212> Type : PRT 
<211> Length : 448 

SequenceName : Sequence # 4 

SequenceDescription : 



2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3265 



60 
120 
180 
240 
300 
360 
420 
448 



Sequence 

<213> organi smName : chlamydomonas rheinhardtii 
<400> PreSequenceStri ng : 

MSALVLKPCA AVSIRGSSCR ARQVAPRAPL AASTVRVALA TLEAPARRLG NVACAAAAPA 
AEAPLSHVQQ ALAELAKPKD DPTRKHVCVQ VAPAVRVAIA ETLGLAPGAT TPKQLAEGLR 
RLGFDEVFDT LFGADLTIME EGSELLHRLT EHLEAHPHSD EPLPMFTSCC PGWIAMLEKS 
YPDLIPYVSS CKSPQMMLAA MVKSYLAEKK GIAPKDMVMV SIMPCTRKQS EADRDWFCVD 
ADPTLRQLDH VITTVELGNI FKERGINLAE LPEGEWDNPM GVGSGAGVLF GTTGGVMEAA 
LRTAYELFTG TPLPRLSLSE VRGMDGIKET NITMVPAPGS KFEELLKHRA AARAEAAAHG 
TPGPLAWDGG AGFTSEDGRG GITLRVAVAN GLGNAKKLIT KMQAGEAKYD FVEIMACPAG 
CVGGGGQPRS TDKAITQKRQ AALYNLDEKS TLRRSHENPS IRELYDTYLG EPLGHKAHEL 
LHTHYVAGGV EEKDEKK 
<212> Type : PRT 
<211> Length : 497 

SequenceName : Sequence # 5 

SequenceDescription : 

Sequence 

<213> Organi smName : Chlorella fusca 
<400> PreSequenceStri ng : 

MCCPVVASRH AGRARHVAVR AAGPTSECDC PPTPQAKLPH WQQALDELAK PKESRRLMIA 60 
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60 
120 
180 
240 
300 
360 
420 
480 
497 




OOMELl-Sequence Data on CRF.WorkFile 
QIASAVRVAI AETIGLAPGD VTIGQLVTGL RMLGFDYVFD TLFGADLTIM EEGTELLHRL 120 
QDHLEQHPNK EEPLPMFTSC CPGWVAMVEK SNPELIPYLS SCKSPQMMLG AVIKNYYAQQ 180 
VGVQPSDICN VSVMPCVRKQ GEADREWFNT TGAGLARDVD HWTTAEVGK IFLERGIKLN 240 
ELPESNFDNP IGEGTGGALL FGTTGGVMEA ALRTVYEVVT QKPMGRVDFE EVRGLEGIKE 300 
AEITLKPGDD SPFKAFAGAD GQGITLKIAV ANGLGNAKKL IKSLSEGKAK YDFIEVMACP 360 
GGCIGGGGQP RSTDKQILQK RQQAMYNLDE RSTIRRSHDN PFIQALYDKF LGAPNSHKAH 420 
DLLHTHYVAG GIPEEK 436 
<212> Type : PRT 
<211> Length : 436 

SequenceName : Sequence # 6 

SequenceDescription : 

Sequence 



<213> OrganismName : Scenedesmus obliquus 
<400> PreSequenceString : 

acaacagagc gttagagata cttcatagct gcaactagac tacctttacc ctaacgaaat 60 

caccctagac cgacagtgtc ggagtagctg cgacccaaac gtgatggcga gcggattgct 120 

tctcaagcag cgctcggtat gcctgagtgg caaccgggag gtcggtatgc tgtttctgtc 180 

cgcccgccag tgaacaggcg ggctgtggtg gcagcagagc gcaggcgcct tgttgtgcgg 240 

gcagctggcc caacagcaga atgtgattgc ccaccagctc ccgcgcccaa ggccccgcac 300 

.tggcagcaga cgctagatga gctagccaag cctaaggagc agcgcaaggt gatgatcgcc 360 

cagatcgcac cagcagtgcg cgtggctatt gcagagacca tgggactcaa ccctggggat 420 

gtgacagttg gccagatggt gaccggcctg cgcatgctgg gctttgatta tgtgtttgac 480 

acgctgtttg gtgctgacct caccatcatg gaggagggca cagagctact gcacaggctt 540 

caggaccacc tggagcagca ccccaacaag gaggagccgc tgcccatgtt caccagctgc 600 

tgccctggct gggtggccat ggtggagaag tccaaccccg agctcatccc ctacctgtct 660 

tcctgcaagt cgccccagat gatgctgggc gcagtcatca agaactactt cgctgccgag 720 

gccggcgcca agcctgagga catctgcaac gtgagcgtga tgccctgcgt gcgcaagcag 780 

ggcgaggctg accgcgagtg gttcaacacc acaggggctg gcggcgcgaa cgtggaccac 840 

gtcatgacaa ctgcagagct gggcaagatc tttgtggagc gcggaatcaa gctgaacgac 900 

ctgcaggaga cgccctttga caaccccgtc ggcgagggca gcggcggcgt actgttcggc 960 

accactggag gcgtgatgga ggcggcgctg cgcaccgtgt acgaagtggt cacacagaag 1020 

cctttggacc gcatcgtctt tgaggacgtg cgcggcctgg agggcatcaa ggagtccacg 1080 

ctgcacctca ccccaggccc caccagcccc ttcaaggcct ttgcaggcgc agacggcacc 1140 

ggcatcaccc tcaacatcgc ggtcgccaac ggcctcggca atgccaagaa gctcatcaag 1200 

cagctggctg caggcgagag caagtacgac ttcatcgagg tcatggcctg ccccggcggc 1260 

tgcatcggcg gcggcggcca gccgcgcagc gcggacaagc agatcctgca gaagcgccag 1320 

gcggccatgt acgacctgga cgagcgcgcg gtgatccggc gcagccacga gaacccgctg 1380 

attggcgcgc tgtatgagaa gttcctgggc gagcccaacg gccacaaggc gcacgagctg 1440 

ctgcacacgc actacgtggc cggcggcgtg cccgatgaga agtgaagcgg tggctggtga 1500 

tgctggctgc ggcgaagaaa cggtgggcat ggtggtgggt gggttgctgc atggtggtgt 1560 

cgctcgtgca gcatggtggg tttgcggttg tgatgttggg catgctgcac ggaggtgttt 1620 

gcatggttat ggatatggtt caggtgctgt gctgcgtcgc atgccataag caccttgtga 1680 

ccctgtgcga tgcataaaaa tagatattgc catttggttc caggctggtg gtggcagtgg 1740 

ctggttaaca ggggagtgtg tgtgtttgtg tgtcttcatt gtcggtgtgt tcttgctgca 1800 

tgtattgtag tgtaatgggt tatgcacgcc tgcatgcgca cgcgctcctc gtgctgcgac 1860 

agtgcacaac gcacagcgtg atacagctgc aggacgtttg cggaaaaaca cttgttactg 1920 

gtgacggctg aagcagcgat gatggagaga atggattcgc tgctatctca cagggcgtgg 1980 

ctgctgcatc gccatggcat gtccctgttg cacgcaattg cctgcgtaat tttgatagtg 2040 

gcagcactga ggcagctgca aggccttctg ccagcggctg tttgtgtcct atctgtgttt 2100 

acaggcagct gcatttgaag gcaagggggt tggccatcac tcactttgat cactcacttt 2160 

gaagcaggct tccatccatg tattggtcaa cgcactgaag ttcttttttt gtcaccaggc 2220 

agcagtattg tgtgcacact acttgctatg gagatgacacj cagcatcaat ctcaagcatg 2280 

atgaaagcgt atgttgtatc agtgccccat tttgcagact cttaagagct ttaccttctc 2340 

aggggttgca gcaggtggtg gtcagccagt tgagggagtg tgtggctgtt gtcttgccac 2400 

catgtgagta ttgaaaccac catcctgagc taagtgttca ggcatcttac cctcataccc 2460 

cgctaccctg ctactgggag tttcgtttca ttgtattggc agccgtttac taattagtaa 2520 

tggcgcttga gcgaggcatg tcttgatatg tatgccttag gagagtgtga gctcaactca 2580 

attctcataa gtgtaagcca cacaactgga aaaaaaaaaa aaaaaaaaaa aaaaaa 2636 

<212> Type : RNA 
<211> Length : 2636 
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OOMELl-Sequence Data on CRF.WorkFile 
SequenceName : Sequence # 7 
SequenceDescription : 

Sequence 



<213> OrganismName : Chlamydomonas rheinhardti i 

<400> PreSequenceString : 

ATCTTACATG AACACACAAA CACTCTCGCA GGCACTAGCC TCAAACCCTC GAAACCTTTT 60 

TCCAACAGTT TACACCCCAA TTCGGACGCC GCTCCAAGCT CGCTCCGTTG CTCCTTCATC 120 

GCACCACCTA TTATTTCTAA TATCGTAGAC GCGACAAGAT GTCGGCGCTC GTGCTGAAGC 180 

CCTGCGCGGC CGTGTCTATT CGCGGCAGCT CCTGCAGGGC GCGGCAGGTC GCCCCCCGCG 240 

CTCCGCTCGC AGCCAGCACC GTGCGTGTAG CCCTTGCAAC ACTTGAGGCG CCCGCACGCC 300 

GCCTAGGCAA CGTCGCTTGC GCGGCTGCCG CACCCGCTGC GGAGGCGCCT TTGAGTCATG 360 

TCCAGCAGGC GCTCGCCGAG CTTGCCAAGC CCAAGGACGA CCCCACGCGC AAGCACGTCT 420 

GCGTGCAGGT GGCTCCGGCC GTTCGTGTCG CTATTGCCGA GACCCTGGGC CTGGCGCCGG 480 

GCGCCACCAC CCCCAAGCAG CTGGCCGAGG GCCTCCGCCG CCTCGGCTTT GACGAGGTGT 540 

TTGACACGCT GTTTGGCGCC GACCTGACCA TCATGGAGGA GGGCAGCGAG CTGCTGCACC 600 

GCCTCACCGA GCACCTGGAG GCCCACCCGC ACTCCGACGA GCCGCTGCCC ATGTTCACCA 660 

GCTGCTGCCC CGGCTGGATC GCTATGCTGG AGAAATCTTA CCCGGACCTG ATCCCCTACG 720 

TGAGCAGCTG CAAGAGCCCC CAGATGATGC TGGCGGCCAT GGTCAAGTCC TACCTAGCGG 780 

AAAAGAAGGG CATCGCGCCA AAGGACATGG TCATGGTGTC CATCATGCCC TGCACGCGCA 840 

AGCAGTCGGA GGCTGACCGC GACTGGTTCT GTGTGGACGC CGACCCCACC CTGCGCCAGC 900 

TGGACCACGT CATCACCACC GTGGAGCTGG GCAACATCTT CAAGGAGCGC GGCATCAACC 960 

TGGCCGAGCT GCCCGAGGGC GAGTGGGACA ATCCAATGGG CGTGGGCTCG GGCGCCGGCG 1020 

TGCTGTTCGG CACCACCGGC GGTGTCATGG AGGCGGCGCT GCGCACGGCC TATGAGCTGT 1080 

TCACGGGCAC GCCGCTGCCG CGCCTGAGCC TGAGCGAGGT GCGCGGCATG GACGGCATCA 1140 

AGGAGACCAA CATCACCATG GTGCCCGCGC CCGGGTCCAA GTTTGAGGAG CTGCTGAAGC 1200 

ACCGCGCCGC CGCGCGCGCC GAGGCCGCCG CGCACGGCAC CCCCGGGCCG CTGGCCTGGG 1260 

ACGGCGGCGC GGGCTTCACC AGCGAGGACG GCAGGGGCGG CATCACACTG CGCGTGGCCG 1320 

TGGCCAACGG GCTGGGCAAC GCCAAGAAGC TGATCACCAA GATGCAGGCC GGCGAGGCCA 1380 

AGTACGACTT TGTGGAGATC ATGGCCTGCC CCGCGGGCTG TGTGGGCGGC GGCGGCCAGC 1440 

CCCGCTCCAC CGACAAGGCC ATCACGCAGA AGCGGCAGGC GGCGCTGTAC AACCTGGACG 1500 

AGAAGTCCAC GCTGCGCCGC AGCCACGAGA ACCCGTCCAT CCGCGAGCTG TACGACACGT 1560 

ACCTCGGAGA GCCGCTGGGC CACAAGGCGC ACGAGCTGCT GCACACCCAC TACGTGGCCG 1620 

GCGGCGTGGA GGAGAAGGAC GAGAAGAAGT GAGGAGCGCC AGAGGCTCTT TGGGCGGAGA 1680 

CAGCTTCAAA GCGAGGGGGC GTATTAGCAG TACCGTAAAT ATGCACTGAT GGGTGATGCG 1740 

GGTGTCCTCC TTTATATTGA ATGGGGTCAA AATAGGCGGC GGGTCAAATG TTTCC I I I I I 1800 

GAGTGGTGTC ACAGCATGGG GCACGTGTGC GGAGGCCAGT AGGCTGTTCA CTGCACGCTG 1860 

GCATTAGGCG TAGGTACTGG CATGAGGGAG CGCGGCTTGC TAACCGAATG GCGTATCCCT 1920 

CCAGGGCACG TCGGAATGGC GCGTGCCCAT CAACGCAAAT TCTTGGCCTT CATCGCTTCT 1980 

GGATATTGAA GCTGCACAAA CCTGCATTCT ATTTGCTTGT TTACACGTGC CCCAATCTTG 2040 

GTTGGAAGCT AAACATGTTT GGGAACAATT CATCTTACTA AAGCGTGTGG GGGTTGAGGA 2100 

TGCGCACGTT GTGCGCTGGT GGGTGGGCGG GAACGTGGGT AGCATTTAGG CTAGCTGGCA 2160 

TACGACAACG GGGCCCGTGA GGATTGAGCA CTTGACTCGC GAACTTATGA ACGTAGCGCT 2220 

TTATACCCAC CGTATGCGAT TGACGTTGGT GTAGGCAACC AGGCGGTAGG AAGGCGGAGA 2280 

GATGCATTGC AAACGCCTGT AAAAGAACGG CATAGCTACT AGACACTCTG ATGTGGACCC 2340 

TTGGCGCAGC CACGACAGGA GAGGTGTGCA TCAGCCGCTT GTAAGCACGC ACTTCTGAG 2399 

<212> Type : PRT 
<211> Length : 2399 

SequenceName : Sequence # 8 

SequenceDescription : 

Sequence 



<213> OrganismName : Chlorella fusca 
<400> PreSequenceString : 

gcggaattac tagtgataag cagtggtaac aacgcagagt cgcgggcagg gactcgatca 60 

gttgttatgt gttgccccgt ggttgcaagt aggcacgcag ggcgtgcaag gcatgttgct 120 

gtccgtgcag cagggccaac atctgagtgt gattgtcctc caacacctca ggccaagctg 180 

cctcactggc agcaggctct ggatgagctc gccaagccca aggagagcag gaggttgatg 240 

atcgcgcaaa tcgcctccgc tgttcgtgtc gctattgctg agaccattgg cttggcccca 300 

ggagatgtca ccattgggca gctcgtgact gggctgcgta tgcttggctt tgattatgtc 360 
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OOMELl-Sequence Data on CRF.WorkFile 

tttgacaccc tgtttggtgc tgacctgacc attatggagg agggaacgga gctgctgcat 420 

cgcctgcagg accatctgga gcagcacccc aacaaggagg agccactgcc catgttcacc 480 

agttgctgcc caggctgggt tgccatggtt gaaaagagca atcctgagct catcccctac 540 

ctgtcatctt gcaagtcgcc tcagatgatg cttggggccg ttatcaagaa ctactatgca 600 

cagcaggttg gagtgcagcc cagtgacatc tgcaacgtgt cagtcatgcc atgcgtacgc 660 

aagcagggag aggctgaccg ggagtggttc aacaccacag gtgcaggcct tgcccgtgat 720 

gttgatcatg tggtgactac tgctgaggtt ggtaagatat tcctggagcg tggcatcaag 780 

ctgaatgagc tgccagagag caactttgac aaccccattg gcgagggcac aggtggtgct 840 

ctgctgtttg gcaccactgg aggtgtcatg gaggcagcac ttcgcacagt ctatgaagtg 900 

gtgacccaga agcccatggg tcgtgttgac tttgaggagg tgcgaggcct tgaaggaatc 960 

aaggaggcag agatcacact caagccagga gacgacagcc cattcaaagc cttcgcagga 1020 

gctgatgggc agggcatcac gctcaagatt gcagtagcca atgggcttgg caatgccaag 1080 

aagctcatca agagcctgtc agagggcaag gccaagtatg atttcattga ggtcatggca 1140 

tgccctggtg gctgcattgg cggaggcggt cagccccgca gtactgacaa gcagatcctg 1200 

cagaagcgcc agcaggctat gtacaacctg gatgagcgca gtaecatccg ccgcagccat 1260 

gataacccat tcatccaggc gctgtatgac aagttcctag gcgcacccaa cagccacaag 1320 

gcacatgatc tgctgcacac acactatgtg gcaggtggaa ttccagagga gaagtgaggg 1380 

accgaggccg gagtggtgtt attagtgtag agctaggcag cagggatctg gccgcatttg 1440 

ggtgctgttg tttggtttgg catcaaagat atgatgaatg tacaatctat tgggttcttt 1500 

gtatctcatt catgactgct gcttggtgag gtatgggcca ggaagaagcc cgcatcaatg 1560 

catgtgaact aggtggctcc acatatgaac cctatctgga tgtttaaggt acctgaaaca 1620 

atagtgcatc ggctctgcat ggctcaacaa cctgtcttca gagcaggtgt attccacacc 1680 

atcttgattt acctaccact ctgtagttca agtggtcaaa ttgaatgtct atggcagcta 1740 

cgcctgcagt tcatagtcta tgaaggtttc accagagtcc atgtccctca tattttttgt 1800 

tttatatgcc ttgattatgc cccttgaacc atgctcaatg cacacaagtt ggtcgcagga 1860 

caggcggcat cgtacatctc aattttcaga acttgtcagt gcggcattgc cttatttgta 1920 

ctcttgcagt cctgtttcac ccttgctact gccttgcatg catcttgttt ttgcaagcaa 1980 

cagctcatgc attgcaatcg atcatcacgt acatccgtgc catattcaca tggttttgac 2040 

ttgcaaatca accaccaggc agtgggtaaa ttgccaggct gggtgcactt tgggccattt 2100 

gggcagccct cttgtggcga gctttgctgc agggccaagc tgagtgcatc agactcagca 2160 

ggctgctgct ggcactgtag aatgctgaaa agggcattca actacatgtc attattaggt 2220 

tgacctgaga cagccgtaag aatatcattg tgtgctgaac ttagtcgtca atgtcatgcc 2280 

atgatgtgtg tttcagggat ggataaggga ggtccttcct caattacatg cctttcaaga 2340 

gacttcaata tctgttgtca gtgacttgtt tgtgtttgct taatccagtg gttctcaaaa 2400 

aaaaaaaaaa aaaaaaaaaa a 2421 
<212> Type : RNA 
<211> Length : 2421 

SequenceName : Sequence # 9 

SequenceDescription : 
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